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Abstract. The relation of const ant- fact or approximability to fixed-parameter tractabil- 
ity and kernelization is a long-standing open question. We prove that two large classes of 
constant-factor approximable problems, namely MIN F^IIi and MAX NP, including the 
, well-known subclass MAX SNP, admit polynomial kernelizations for their natural decision 

versions. This extends results of Cai and Chen (JCSS 1997), stating that the standard pa- 
rameterizations of problems in MAX SNP and MIN F"'"IIi are fixed-parameter tractable. 



, and complements recent research on problems that do not admit polynomial kernelizations 
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(Bodlaender et al. ICALP 2008). 
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^ ' 1. Introduction 

The class AFX consists of all NP optimization problems that are approximable to within 
a constant factor of the optimum. It is known that the decision versions of most APX- 
problems are fixed-parameter tractable or even admit efficient preprocessing in the form of 
[ a polynomial kernelization. How strong is the relation between constant-factor approxima- 

' bility and polynomial kernelizability? Is there a property inherent to most APX-problems 

that explains this relation? What is the nature of APX-problems that do not admit a 
polynomial kernelization, such as Bin Packing for example? 

Since many prominent APX-problems are complete under approximation preserving 
5^ I reductions and do not admit arbitrarily small approximation ratios, studying their param- 

eterized complexity is a natural approach to obtain better results (recently Cai and Huang 
presented fixed-parameter approximation schemes for MAX SNP [7J). In conjunction with 
recent work on problems without polynomial kernelizations, positive answers to the ques- 
tions may provide evidence against APX-membership for some problems (e.g. Treewidth). 

Our work: We prove that the standard parameterizations of problems in two large classes of 
constant-factor approximable problems, namely MIN F"*"!!! and MAX NP, admit polyno- 
mial kernelizations. This extends results of Cai and Chen [6] who showed that the standard 
parameterizations of all problems in MIN F+Hi and MAX SNP (a subclass of MAX NP) 
are fixed-parameter tractableQ Interestingly perhaps, both our results rely on the Sunflower 
Lemma due to Erdos and Rado ^lOj . 



1998 ACM Subject Classification: F.2.2. 
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-'^The existence of a kernelization, not necessarily polynomial, is equivalent to fixed-parameter tractability. 
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Approximation ratio 


Kernel size 


Minimum Vertex Cover 
Feedback Vertex Set 
Minimum Fill-In 
Treewidth 


2tl5j 
2[3] 

o(opt) [12] 

o(viogopt) [n] 


o{k) m 

0{k^) [Ij 
0(A:2) [TO] 
not poly^ [5] 



Table 1: Approximation ratio and size of problem kernels for some optimization problems. 

Related work: Recently Bodlaender et al. [5j presented the first negative results concern- 
ing the existence of polynomial kernelizations for some natural fixed-parameter tractable 
problems. Using the notion of a distillation algorithm and results due to Fortnow and 
Santhanam [14], they were able to show that the existence of polynomial kernelizations 
for so-called compositional parameterized problems implies a collapse of the polynomial 
hierarchy to the third level. These are seminal results presenting the first super-linear 
lower bounds for kernelization and relating a statement from parameterized complexity to 
a hypothesis from classical complexity theory. 

In Tabled) we summarize approximability and kernelization results for some well-known 
problems. 

MIN F^IIi and MAX NP: Two decades ago Papadimitriou and Yannakakis [23] initi- 
ated the syntactic study of optimization problems to extend the understanding of approx- 
imability. They introduced the classes MAX NP and MAX SNP as natural variants of NP 
based on Fagin's [11] syntactic characterization of NP. Essentially problems are in MAX NP 
or MAX SNP if their optimum value can be expressed as the maximum number of tuples for 
which some existential, respectively quantifier-free, first-order formula holds. They showed 
that every problem in these two classes is approximable to within a constant factor of the 
optimum. Arora et al. complemented this by proving that no MAX SNP-complete problem 
has a polynomial-time approximation scheme, unless P=NP Contained in MAX SNP 
there are some well-known maximization problems, such as Max Cut, Max q-Sat, and 
Independent Set on graphs of bounded degree. Its superclass MAX NP also contains 
Max Sat amongst others. 

Kolaitis and Thakur generalized the approach of examining the logical definability 
of optimization problems and defined further classes of minimization and maximization 
problems \n\ I18| . Amongst others they introduced the class MIN F"^!!! of problems 
whose optimum can be expressed as the minimum weight of an assignment (i.e. number of 
ones) that satisfies a certain universal first-order formula. They proved that every problem 
in MIN F^IIi is approximable to within a constant factor of the optimum. In MIN F^IIi 
there are problems like Vertex Cover, d-HiTTiNC Set, and Triangle Edge Deletion. 

Section [2] covers the definitions of the classes MIN F^IIi and MAX NP, as well as the 
necessary details from parameterized complexity. In Sections [3] and H] we present polynomial 
kernelizations for the standard parameterizations of problems in MIN F'''ni and MAX NP 
respectively. Section [5] summarizes our results and poses some open problems. 



Treewidth does not admit a polynomial kernelization unless there is a distillation algorithm for all coNP 
complete problems [5]- Though unlikely, this is not known to imply a collapse of the polynomial hierarchy. 
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2. Preliminaries 

Logic and complexity classes: A (relational) vocabulary is a set a of relation sym- 
bols, each having some fixed integer as its arity. Atomic formulas over a are of the 
form R(zi,...,zt) where R is a i-ary relation symbol from a and the Zi are variables. 
The set of quantifier-free (relational) formulas over a is the closure of the set of all atomic 
formulas under negation, conjunction, and disjunction. 

Definition 2.1 (MIN F+IIi, MAX NP). A finite structure of type (ri,...,rt) is a tu- 
ple A = {A, Ri, . . . , Rt) where j4 is a finite set and each i?j is an r^-ary relation over A. 

Let Q be an optimization problem on finite structures of type (ri, . . . , rj). Let Ri, . . . ,Rt 
be relation symbols of arity ri, . . . ,rt. 

(a) The problem Q is contained in the class MIN F"*"!!! if its optimum on finite structures A 
of type (ri, . . . , rt) can be expressed as 

opt2(^) = mm{\S\ : {A, S) |= (Vx e ^^-) : ^'(x, S)}, 

where S" is a single relation symbol and ip{x, S) is a quantifier-free formula in conjunc- 
tive normal form over the vocabulary . . . ,Rt, S} on variables {xi, . . . , Xc^}. Further- 
more, ip{x,S) is positive in S, i.e. S does not occur negated in ip{x,S). 

(b) The problem Q is contained in the class MAX NP if its optimum on finite structures A 
of type (ri, . . . , rt) can be expressed as 

opt2(^) = max|{x G : {A,S) \= (3y e A^y) : V(x,y,5)}| , 

where S = {Si, . . . , Su) is a tuple of Sj-ary relation symbols Si and V'(x, y, S) is a quantifier- 
free formula in disjunctive normal form over the vocabulary {Ri, . . . , Rt, Si, . . . , Su} on 
variables {xi, . . . ,Xc.„yi, ■ ■ .,ycy}- 

Remark 2.2. The definition of MAX SNP is similar to that of MAX NP but without 
the existential quantification of y, i.e optg(^) = max^ |{x : {A,S) |= Tp(x,S)}\. 

Example 2.3 (Minimum Vertex Cover). Let G = {V, E) be a finite structure of type (2) 
that represents a graph by a set V of vertices and a binary relation E over V as its edges. 
The optimum of Minimum Vertex Cover on structures G can be expressed as: 

optyc(G) = min{|5| : (G, S) \= (V(n, v) G V"^) : {^E{u, v) V S{u) V S{v))}. 

This implies that Minimum Vertex Cover is contained in MIN F+IIi. 

Example 2.4 (Maximum Satisfiability). Formulas in conjunctive normal form can be 
represented by finite structures = (F, P, N) of type (2, 2): Let F be the set of all clauses 
and variables, and let P and N be binary relations over F. Let P{x, c) be true if and only 
if X is a literal of the clause c and let N{x, c) be true if and only if -ix is a literal of the 
clause c. The optimum of Max Sat on structures J- can be expressed as: 

optMsiT) = max|{c G F : {J',T) \= {3x G F) : {P{x,c) AT{x)) V {N{x,c) A -T(x))}|. 

Thus Max Sat is contained in MAX NP. 

For a detailed introduction to MIN F+IIi, MAX NP, and MAX SNP we refer the 
reader to [T71 [181 123] . An introduction to logic and complexity can be found in [22] . 

Parameterized complexity: The field of parameterized complexity, pioneered by Downey 
and Fellows, is a two-dimensional approach of coping with combinatorially hard problems. 
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Parameterized problems come with a parameterization that maps input instances to a pa- 
rameter value. The time complexity of algorithms is measured with respect to the input 
size and the parameter. In the following we give the necessary formal definitions, namely 
fixed-parameter tractability, standard parameterizations, and kernelization. 

Definition 2.5 (Fixed-parameter tractability). A parameterization of S* is a polynomial- 
time computable mapping k : S* — > N. A parameterized problem over an alphabet S is a 
pair (Q, k) consisting of a set Q C S* and a parameterization k of S*. 

A parameterized problem {Q,k) is fixed-parameter tractable if there exists an algo- 
rithm A, a polynomial p, and a computable function / : N ^ N such that A decides x £ Q 
in time f{K{x)) -p^xl). FPT is the class of all fixed-parameter tractable problems. 

Definition 2.6 (Standard parameterization). Let Q be a maximization (minimization) 
problem. The standard parameterization of Q is p-Q = {d-Q,K) where k : {A,k) i— > k 
and d-Q is the language of all tuples {A, k) such that optQ(^) > k (optQ(^) < k). 

Basically d-Q is the decision version of Q, asking whether the optimum is at least k 
(respectively at most k). The standard parameterization of Q is d-Q parameterized by k. 

Definition 2.7 (Kernelization). Let (Q, k) be a parameterized problem over S. A poly- 
nomial-time computable function K : T,* ^ Ti* is a kernelization of (Q, k) if there is a 
computable function /i : N — > N such that for all x G S* we have 

(x e Q 4^ K{x) G Q) and \K{x)\ < h{K{x)). 

We call h the size of the problem kernel K{x). The kernelization K is polynomial if h 
is a polynomial. We say that (Q,k) admits a (polynomial) kernelization if there exists a 
(polynomial) kernelization of (Q,k). 

Essentially, a kernelization is a polynomial-time data reduction that comes with a guar- 
anteed upper bound on the size of the resulting instance in terms of the parameter. 
For an introduction to parameterized complexity we refer the reader to [9l [T3l [20] . 

Hypergraphs and sunflowers: We assume the reader to be familiar with the basic graph 
notation. A hypergraph is a tuple Ti. = {V, E) consisting of a finite set V ^ its vertices, and 
a family E of subsets of V , its edges. A hypergraph has dimension d if each edge has 
cardinality at most d. A hypergraph is d-uniform if each edge has cardinality exactly d. 

Definition 2.8 (Sunflower). Let TC he a hypergraph. A sunflower of cardinality r is a 
set F = {/i, . . . , fr} of edges of Ti. such that every pair has the same intersection C, i.e. for 
all 1 < f < j < r: /j n fj = C. The set C is called the core of the sunflower. 

Note that any family of pairwise disjoint sets is a sunflower with core C = 0. 

Lemma 2.9 (Sunflower Lemma [10]). Let k,d £ N and let TC be a d-uniform hypergraph 
with more than {k — l)'^-dl edges. Then there is a sunflower of cardinality k inTi. For every 
fixed d there is an algorithm that computes such a sunflower in time polynomial in \E{7i)\. 

Corollary 2.10 (Sunflower Corollary). The same holds for d- dimensional hypergraphs with 
more than {k — 1)"^ ■ d\ ■ d edges. 

Proof. For some d' G {1, . . . ,d}, 7i has more than (/c — l)'^ ■ d\ > {k — 1)'^' ■ d'\ edges of 
cardinality d' . Let Hd' be the d'-uniform subgraph induced by the edges of cardinality d' . 
We apply the Sunflower Lemma on 7id' and obtain a sunflower F of cardinality k in time 
polynomial in \E{7id')\ < 1-^(^)1- Clearly F is also a sunflower of H. m 
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3. Polynomial kernelization for MIN F Hi 

We will prove that the standard parameterization of any problem in MIN F^IIi admits 
a polynomial kernelization. The class MIN F"*"!!! was introduced by Kolaitis and Thakur 
in a framework of syntactically defined classes of optimization problems [17j • In a follow-up 
paper they showed that every problem in MIN F'^'IIi is constant-factor approximable |18j . 

Throughout the section let Q £ MIN F'^'IIi be an optimization problem on finite 
structures of type (ri, . . . , rj). Let Ri, . . . , Rt be relation symbols of arity ri, . . . , and 
let 5 be a relation symbol of arity cs- Furthermore, let tp{x, S) be a quantifier- free formula 
in conjunctive normal form over the vocabulary {Ri, . . . ,Rt,S} on variables {xi, . . . , Xc^} 
that is positive in S such that 

opt2(^) = ^m^n^il^l : {A, S) \= (Vx e ^^^) : ^'(x, S)}. 

Let s be the maximum number of occurrences of S in any clause of ip{x,S). The 
standard parameterization p-Q of Q is the following problem: 
Input: A finite structure A of type (ri, . . . , rt) and an integer k. 

Parameter: k. 

Task: Decide whether optQ(^) < k. 

We will see that, given an instance {A, k), deciding whether optQ(^) < k is equivalent 
to deciding an instance of s-Hitting SetH Our kernelization will therefore make use of 
existing kernelization results for s-HiTTiNG Set. The parameterized version of s-Hitting 
Set is defined as follows: 

Input: A hypergraph TC = (V, E) of dimension s and an integer k. 

Parameter: k. 

Task: Decide whether Ti has a hitting set of size at most k, i.e. S" C y, \S\ < k, 

such that S has a nonempty intersection with every edge of TC. 

We consider the formula ip{x, S) and a fixed instance {A, k), with A = {A, . . . , Rt). 
For every tuple x G A'^'^ we can evaluate all literals of the form Ri{z) and -ii?j(z) for 
some z S {xi, . . . ,Xc^Y^. By checking whether z £ Ri, we obtain 1 (true) or (false) for 
each literal. Then we delete all occurrences of from the clauses and delete all clauses 
that contain a 1. For each x, we obtain an equivalent formula that we denote with V'x('S'). 
Each ip:x.{S) is in conjunctive normal form on literals S{z) for some z G {xi, . . . , Xc^}'^^ (no 
literals of the form -^S{z) since ip{:>i., S) is positive in S). 

Remark 3.1. For all x G and S Q A''^ it holds that {A, S) |= i^{x,S) if and only 
if {A, S) 1= ip:x_{S). Moreover, we can compute all formulas V'x('S') for x G ^^"^ in polynomial 
time, since Cx and the length of tp{x, S) are constants independent of A. 

Deriving a formula ■0x('S') can yield empty clauses. This happens when all liter- 
als Ri{-), ^Ri{-) in a clause are evaluated to and there are no literals S{-). In that 
case, no assignment S can satisfy the formula ip:x_{S), or equivalently S). Thus {A, k) is 
a no-instance. Note that clauses of 'i/'x(5') cannot contain contradicting literals since S) 
is positive in 5. 

Remark 3.2. From now on, we assume that all clauses of the formulas tl'y-iS) nonempty. 

We define a mapping <I> from finite structures A to hypergraphs 7i. Then we show that 
equivalent s-Hitting Set instances can be obtained in this way. 



'In literature the problem is often called d-HiTTiNG Set but we will need d = s. 
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Definition 3.3. Let A be an instance of Q. We define ^{A) := H witfi H = {V,E). We 
let E be the family of all sets e = {zi, . . . , Zp} such that (5'(zi) V • • • V S{zp)) is a clause of 
a V'x(5') for some x G A^"^ . We let y be the union of all sets e £ E. 

Remark 3.4. The hypergraphs TC obtained from the mapping $ have dimension s since 
each V'x('S') has at most s literals per clause. It follows from Remark 13.11 that ^{A) can be 
computed in polynomial time. 

The following lemma establishes that {A,k) and {T~i,k) = {^{A),k) are equivalent in 
the sense that {A, k) £ p-Q if and only if {7i, k) £ s-Hitting Set. 

Lemma 3.5. Let A = {A, Ri, . . . , Rt) be an instance of Q then for all S C ^"^s 

[A^ S) \= (Vx) : ^(x, S) if and only if S is a hitting set for TL = ^(A). 

Proof. Let H = ^{A) = {V,E) and let 5 C A^S; 

(A5)N(VxGA'=^):V(x,5) 

<^ (Vx G A^'^) : each clause of V'x(5') has a literal S{z) for which z £ S 
<^ S has a nonempty intersection with every set e £ E 

4^ S is a hitting set for (y,E). m 
Our kernelization will consist of the following steps: 

(1) Map the given instance {A, k) for p-Q to an equivalent instance {TC, k) = {^{A), k) 
for s-HiTTiNG Set according to Definition 13.31 and Lemma 13.51 

(2) Use a polynomial kernelization for s-Hitting Set on (7^, A;) to obtain an equivalent 
instance {7i',k) with size polynomial in k. 

(3) Use {TC' , k) to derive an equivalent instance {A' , k) of p-Q. That way we will be able 
to conclude that {A! , k) is equivalent to (W, A;) and hence also to {A, k). 

There exist different kernelizations for s-Hitting Set: one by Flum and Grohe [13] 
based on the Sunflower Lemma due to Erdos and Rado [10] , one by Nishimura et al. [21] via 
a generalization of the Nemhauser- Trotter kernelization for Vertex Cover, and a recent 
one by Abu-Khzam [1] based on crown decompositions. For our purposes of deriving an 
equivalent instance for p-Q, these kernelizations have the drawback of shrinking sets during 
the reduction. This is not possible for our approach since we would need to change the 
formula V'lx, S) to shrink the clauses. We prefer to modify Flum and Grohe's kernelization 
such that it uses only edge deletions. 

Theorem 3.6. There exists a polynomial kernelization of s-Hitting Set that, given an 
instance {TC,k), computes an instance {TC*,k) such that E{TC*) Q E{TC), TC* has 0{k^) 
edges, and the size of {TC*,k) is 0{k^) as well. 

Proof Let {TC,k) be an instance of s-Hitting Set, with TC = {V,E). If TC contains a 
sunflower F = {fi, . . . , fk+i} of cardinality k + 1 then every hitting set of TC must have a 
nonempty intersection with the core C of -F or with the k+1 disjoint sets fi\C, . . . , fk+i\C. 
Thus every hitting set of at most k elements must have a nonempty intersection with C. 

Now consider a sunflower F = {/i, . . . , fk+i, fk+2} of cardinality k+2 in TC and let TC' = 
(y, E \ {fk+2})- We show that the instances {TC, k) and {TC', k) are equivalent. Clearly every 
hitting set for Ti. is also a hitting set for TC' since E{TC') C E{TC). Let S Q V he a hitting set 
of size at most k for TC' . Since F \ {/fc+2} is a sunflower of cardinality k + 1 inTC', it follows 
that S has a nonempty intersection with its core C. Hence S has a nonempty intersection 
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with fk+2 5 C too. Thus 5 is a hitting set of size at most k for TC, implying that {TC, k) 
and (Ti.', k) are equivalent. 

We start with Ti.* = Ti. and repeat the following step while Ti* has more than (fc+l)^-s!-s 
edges. By the Sunflower Corollary we obtain a sunflower of cardinality k + 2 in Ti* in time 
polynomial in \E{Ti*)\. We delete an edge of the detected sunflower from the edge set of Ti* 
(thereby reducing the cardinality of the sunflower to fe + 1). Thus, by the argument from 
the previous paragraph, we maintain that (Ti, k) and {Ti*, k) are equivalent. 

Furthermore E{Ti*) C E{Ti) and Ti* has no more than {k + ly ■ si ■ s ^ 0{k^) edges. 
Since we delete an edge of Ti* in each step, there are 0{\E{Ti)\) steps, and the total time is 
polynomial in \E{Ti)\. Deleting all isolated vertices from Ti* yields a size of 0{s-k^) = 0{k^) 
since each edge contains at most s vertices. ■ 

The following lemma proves that every s-Hitting Set instance that is "sandwiched" 
between two equivalent instances must be equivalent to both. 

Lemma 3.7. Let {Ti,k) be an instance o/ s -Hitting Set and let (Ti*,k) be an equiva- 
lent instance with E{Ti*) C E{T<). Then for any Ti' with E{Ti*) C E{Ti') C E{Ti) the 
instance {Ti',k) is equivalent to {Ti,k) and {Ti*,k). 

Proof. Observe that hitting sets for Ti can be projected to hitting sets for Ti' (i.e. restricted 
to the vertex set of Ti') since E{Ti') C E{Ti). Thus if iji, k) is a yes-instance then {Ti' , k) 
is a yes-instance too. The same argument holds for {Ti' , k) and {Ti* , k). Together with the 
fact that {Ti,k) and {Ti*,k) are equivalent, this proves the lemma. ■ 

Now we are well equipped to prove that p-Q admits a polynomial kernelization. 

Theorem 3.8. Let Q E MIN F^IIi. The standard parameterization p-Q of Q admits a 
polynomial kernelization. 

Proof. Let {A, k) be an instance of p-Q. By Lemma [3.5l we have that {A, k) is a yes-instance 
of p-Q if and only if {Ti,k) = {^{A),k)) is a yes-instance of s-Hitting Set. We apply 
the kernelization from Theorem 13.61 to (W, k) and obtain an equivalent s-Hitting Set 
instance {Ti* , k) such that E{Ti*) C E{ri) and Ti* has 0{k') edges. 

Recall that every edge oiTi, say {zi, . . . , Zp}, corresponds to a clause (S'(zi) V- • •VS'(zp)) 
of ip:x.{S) for some x G A^"^ . Thus for each edge e € E{Ti*) C E{Ti) we can select a tuple Xg 
such that e corresponds to a clause of V'xe('S')- Let X be the set of the selected tuples Xg for 
all edges e £ E{Ti*). Let A' <^ A he the set of all components of tuples Xg G X, ensuring 
that X C A'"^. Let R'^ be the restriction of Ri to A' and let A' = {A', R'^, . . . , R't). 

Let {Ti',k) = {^{A'),k). By definition of ^ and by construction of Ti' we know 
that E{Ti*) C E{Ti') C E{Ti) since X C A""- and A' C A. Thus, by Lemma EZl we 
have that {Ti',k) is equivalent to {Ti,k). Furthermore, by Lemma \3.5\ {Ti',k) is a yes- 
instance of s-Hitting Set if and only if {A',k) is a yes-instance of p-Q. Thus {A',k) 
and {A,k) are equivalent instances of p-Q. 

We conclude the proof by giving an upper bound on the size of {A' , k) that is polynomial 
in k. The set X contains at most \E{Ti*)\ £ 0{k^) tuples. These tuples have no more 
than Cx ■ \E{Ti*)\ different components. Hence the size of A' is 0{cx ■ A;*) = 0{k^). Thus 
the size of {A' , k) is ©(A;*""), where m is the largest arity of a relation Ri. The values Cx, s, 
and m are constants that are independent of the input {A, k). Thus {A' , k) is an instance 
equivalent to {A, k) with size polynomial in k. m 
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4. Polynomial kernelization for MAX NP 

We prove that the standard parameterization of any problem in MAX NP admits 
a polynomial kernelization. The class MAX NP was introduced by Papadimitriou and 
Yannakakis in ^23j. They showed that every problem in MAX NP is constant-factor ap- 
proximable. 

Throughout the section let Q £ MAX NP be an optimization problem on finite 
structures of type (ri, . . . , rj). Let Ri, . . . , Rt be relation symbols of arity ri, . . . , rt and 
let S = {Si, . . . , Su) be a tuple of relation symbols of arity si, . . . , Su- Let ijj{x,y,S) be 
a formula in disjunctive normal form over the vocabulary {Ri, . . . , Rt, Si, . . . , Su} on vari- 
ables {xi, . . . , Xc^,yi, . . . , ycy} such that for all finite structures A of type (ri, . . . , rt): 

optQiA) = max|{x E A^^ : iA,S) |= (3y e A^y) : V(x,y,5)}|. 

Let s be the maximum number of occurrences of relations Si,...,Su in any disjunct 
of ^{x,y,S). The standard parameterization p-Q of Q is the following problem: 

Input: A finite structure A of type (ri, . . . , rt) and an integer k. 

Parameter: k. 

Task: Decide whether optQ(^) > k. 

Similarly to the previous section, we consider the formula 'ip{x,'y,S) and a fixed in- 
stance {A, k) with A = {A, Ri, . . . , Rt). We select tuples x E A'^'^ and y S A'^v and evaluate 
all literals of the form Ri{z) and -ii?i(z) for some z E {xi, . . . , Xx^,yi, • • • , UcyY'- By check- 
ing whether z £ Ri we obtain 1 (true) or (false) for each literal. Since ip{x,'y,S) is in 
disjunctive normal form, we delete all occurrences of 1 from the disjuncts and delete all 
disjuncts that contain a 0. Furthermore, we delete all disjuncts that contain contradicting 
literals Sj{z),^Sj{z) since they cannot be satisfied. We explicitly allow empty disjuncts 
that are satisfied by definition for the sake of simplicity (they occur when all literals in a 
disjunct are evaluated to 1). We obtain an equivalent formula that we denote with ^^^y{S). 

Remark 4.1. For all x, y, and S it holds that {A,S) |= '(/'(x, y,5) iff {A,S) |= Vx,y('5). 
Moreover, we can compute all formulas V'x,y('5) for x E A'^'^ , y E A'^y in polynomial time, 
since Cx, Cy, and the length of Tp(x,y,S) are constants independent of A. 

Definition 4.2. Let A = {A, Ri, . . . , Rt) be a finite structure of type (ri, . . . , rt). 

(a) We define C A^"^ as the set of all tuples x such that (3y) : ^/>x,y('5) holds for some S: 

= {x : (35) : iA,S) |= (3y) : ^P^,y{S)}. 

(b) For X E A*^^ we define Yj[{x) as the set of all tuples y such that V'x,y('5) holds for 
some S: 

y4(x) = {y:(35):(A5)|=Vx,y(5)}. 

Remark 4.3. The sets Xj( and Y_a{x.) can be computed in polynomial time since the 
number of tuples x E A^"^ and y E A^y is polynomial in the size of A and ip{x,y,S) is of 
constant length independent of A. 

Lemma 4.4. Let {A,k) be an instance of p-Q. If\Xj^\ > k-2^ then optQ{A) > k, i.e. {A,k) 
is a yes-instance. 

Proof. The lemma can be concluded from the proof of the constant-factor approximability of 
problems in MAX NP in [23j. For each x E we fix a tuple y E ^(x) such that ^x,y('5) 
is satisfiable. This yields \Xj[\ formulas, say ^i, . . . ,^p\XJ^\■ Papadimitriou and Yannakakis 
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showed that one can efficiently compute an assignment that satisfies at least ^ of these 
formulas, where fi is the fraction of all assignments that satisfies ipi. 

To see that fi > 2~*; consider such a formula ipi. Since ipi is satisfiable there exists a 
satisfiable disjunct. To satisfy a disjunct of at most s literals, at most s variables need to 
be assigned accordingly. Since the assignment to all other variables can be arbitrary this 
implies that fi > 2~^. Thus we have that J2 fi — \-^a \ ' Therefore \X_a\ > implies 
that the assignment satisfies at least k formulas, i.e. that optQ(^) > k. m 

Henceforth we assume that \Xj[\ < k ■ 2'^ . 

Definition 4.5. Let {A, k) be an instance of p-Q with A = {A,Ri,. . . , Rt). For x G A'^'^ 
we define D^{^) as the set of all disjuncts of V'x,y('5) for y e Y^{ii). 

Definition 4.6. We define the intersection of two disjuncts as the conjunction of all literals 
that occur in both disjuncts. A sunflower of a set of disjuncts is a subset such that each pair 
of disjuncts in the subset has the same intersection (modulo permutation of the literals). 

Remark 4.7. The size of each L'y^(x) is bounded by the size of Xa(x) C A'^y times the 
number of disjuncts of V'(x,y,5) which is a constant independent of A. Thus the size of 
each D_4(x) is bounded by a polynomial in the input size. The definition of intersection 
and sunflowers among disjuncts is a direct analog that treats disjuncts as sets of literals. 

Definition 4.8. A partial assignment is a set L of literals such that no literal is the negation 
of another literal in L. A formula is satisfiable under L if there exists an assignment that 
satisfies the formula and each literal in L. 

Proposition 4.9. Let {A, k) he an instance of p-Q. For each x G A'^'^ there exists a 
set L'^(x) C D_4(x) of cardinality 0{k^) such that: 

(1) For every partial assignment L of at most sk literals, D*^(x) contains a disjunct 
satisfiable under L, if and only if Dj({x.) contains a disjunct satisfiable under L. 

(2) Z)^(x) can be computed in time polynomial in \ A\. 

Proof. Let A = {A, Ri, . . . , Rt) be a finite structure of type (ri, . . . , r^), let x G A'^'' , and 
let Dj[{x) be a set of disjuncts according to Definition l4.5l From the Sunflower Corollary we 
can derive a polynomial-time algorithm that computes a set L'^(x) by successively shrinking 
sunflowers. We start by setting D^{x) = L'^(x) and apply the following step while the 
cardinality of L'^(x) is greater than {sk + 1)* • s! • s. 

We compute a sunflower of cardinality sk + 2, say F = {/i, . . . , fsk+2}^ in time polyno- 
mial in |D^(x)| (Sunflower Corollary). We delete a disjunct of F, say fsk+2, from D^(x). 
Let O and P be copies of D*^ before respectively after deleting fsk+2- Observe that F' = 
F \ {fsk+2} is a sunflower of cardinality sk + 1 in P. Let L be a partial assignment of at 
most sk literals and assume that no disjunct in P is satisfiable under L. This means that for 
each disjunct of P there is a literal in L that contradicts it, i.e. a literal that is the negation 
of a literal in the disjunct. We focus on the sunflower F' in P. There must be a literal 
in L, say /, that contradicts at least two disjuncts of F\ say / and /', since \F'\ = sk + \ 
and \L\ < sk. Therefore / is the negation of a literal in the intersection of / and /', i.e. the 
core of F' . Thus / contradicts also fsk+2 and we conclude that no disjunct in O = PD{fsk+2} 
is satisflable under the partial assignment L. The reverse argument holds since all disjuncts 
of P are contained in O. Thus each step maintains the desired property (1). 
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At the end D'^{x) contains no more than {sk + 1)^ • s! • s G 0{k^) disjuncts. For each x 
this takes time polynomial in the size of the input since the cardinality of Dyx{x.) is bounded 
by a polynomial in the input size and a disjunct is deleted in each step. ■ 

Lemma 4.10. Let i^^(x) be a subset of D_a{x) such that D^{x) C Z)^(x) C D_4(x). For 
any partial assignment L of at most sk literals it holds that D^(x) contains a disjunct 
satisfiable under L if and only if D'_^{x) contains a disjunct satisfiable under L. 

Proof. Let L be a partial assignment of at most sk literals. If D_4(x) contains a disjunct 
satisfiable under L, then, by Proposition 14.91 this holds also for Z)^(x). For L)^(x) and D'j^ 
this holds since L)^(x) C D^(x). The same is true for Z)^(x) and I?_4(x). ■ 

Theorem 4.11. Let Q G MAX NP. The standard parameterization p-Q of Q admits a 
polynomial kernelization. 

Proof. The proof is organized in three parts. First, given an instance (A, k) of p-Q, we 
construct an instance {A' , k) of p-Q in time polynomial in the size of {A, k). In the second 
part, we prove that {A, k) and {A', k) are equivalent. In the third part, we conclude the 
proof by showing that the size of {A! , k) is bounded by a polynomial in k. 
(I.) Let {A,k) be an instance of p-Q. We use the sets -D^(x) and -D^(x) according to 
Definition 14.51 and Proposition 14.91 Recall that L)^(x) is the set of all disjuncts of '0x,y('5) 
for y G y4(x). Thus, for each disjunct d G -D^(x) C L)^(x), we can select a G ^^^^(x) 
such that d is a disjunct of ■i/'x,y^(5). Let ^_4(x) C y4(x) be the set of these selected 
tuples Yd. Let -D^(x) be the set of all disjuncts of 'i/'x,y('5) for y G ^^(x). Since i?^(x) 
contains some disjuncts of ^/'x.yl'?) for y G ^(x) and -D^(x) contains all disjuncts of •i/'x,y('5) 
for y G y4(x) D yji(x), we have that D\{y.) C L>^(x) C ^^(x). 

For each x this takes time 0{\D\{yi)\ ■ \YX{yi)\) 0{k^ ■ \ A['y). Computing y^(x) for 
all X G A'^^ takes time ©(LAI'^^ • fc^ • i.e. time polynomial in the size of {A, k) since k 

is never larger than |^|^^o 

Let C j4 be the set of all components of x G and y G ^(x) for all x G X_a. This 
ensures that C [A'Y^ and ^ji(x) C [A'yy for all x G X_a. Let R'^ be the restriction 
of Ri to A' and let A' = {A', R[,...,R[). 

(II.) We will now prove that optg(^) > A: if and only if optg(^') > k, i.e. that {A,k) 
and {A',k) are equivalent. Assume that optg(^) > k and let S = {Si, . . . , Su) such 
that |{x : {A,S) |= (3y) : ^(x, y, > k. This implies that there must exist tu- 
ples xi, . . . ,Xfc G A^"^ and yi, . . . ,yA: G A^y such that S satisfies ^x^.y, (5) for z = 1, . . . , k. 
Thus S must satisfy at least one disjunct in each V'xi,yi('5) since these formulas are in 
disjunctive normal form. Accordingly let di,. . . ,dk be disjuncts such that S satisfies the 
disjunct di in ip:x.i,yi{<S) for i = 1, . . . , k. We show that there exists S' such that: 

|{x:(^',5')|=(3y):V(x,y,5')}|>A:. 

For p = 1, . . . ,k we apply the following step: If y^ G YX{xp) then do nothing. Oth- 
erwise consider the partial assignment L consisting of the at most sk literals of the dis- 
juncts di, . . . , dk- The set D^(xp) contains a disjunct that is satisfiable under L, namely dp. 
By Lemma [4.10| it follows that D^(xp) also contains a disjunct satisfiable under L, say dp. 
Let y'p G YX{xp) such that dp is a disjunct of V'xp,yj,('5). Such a y'p can be found by selection 
of L'^(xp). Change S in the following way to satisfy the disjunct dp. For each literal of dp 
of the form S'j(z) add z to the relation Si. Similarly for each literal of the form -iS'j(z) 

^That is, (.4, A;) is a no- instance if k > \A\'^'' since k exceeds the number of tuples x € A"^^ . 
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remove z from Si. This does not change the fact that S satisfies the disjunct di in 'ipXi,yi{'S) 
for: i = 1, . . . , k since, by selection, dp is satisfiable under L. Then we replace by 
and dp by dp. Thus we maintain that S satisfies di in V'xi,yi('5) for i = 1, . . . ,k. 

After these steps we obtain S as well as tuples xi, . . . , x^, yi, • • • , y^ with yj € yjj(xj), 
and disjuncts di, . . . ,dk such that S satisfies di in ip:x.^,y,{S) for i = 1, . . . , fc. Let S' be the 
restriction of S to A'. Then we have that (A' ,S') |= ipyci,yiiS') for i = 1, . . . ,k since A' is 
defined to contain the components of tuples x G X_4 and of all tuples y G ^(x) for x E 
Hence x^ G {x : (^',5') ^ (3y) : V'(x,y,cS')} for i = 1, . . . , k. Thus optQ(^') > /c. 

For the reverse direction assume that optg(^') > k. Since A' <^ A it follows that 

{x : {A',S') 1= (3y) : V(x,y,cS')} Q {x : {A,S') \= (3y) : V(x,y,cS')}. 

Thus |{x : {A,S') 1= (3y) : V(x,y,cS')}| > k, implying that optQ(^) > k. There- 
fore optg(^) > A; if and only if optg(^') > k. Hence {A,k) and {A',k) are equivalent 
instances of p-Q. 

(HI.) We conclude the proof by providing an upper bound on the size of (A' , k) that is 
polynomial in k. For the sets 5^(x) we selected one tuple y for each disjunct in D^(x). 
Thus |y^(x)| < |L'*(x)| G 0{k'^) for all x G The set A' contains the components of 
tuples X G X_A. and of all tuples y G ^ji(x) for x G X_a. Thus 

< ■ \Xa\ + Cy ■ \Xa\ ■ 0{k') 

< c^-k-2' + cyk-2' ■0{k') = 0{k'+^). 

For each relation i?' we have \R'i\ < |yl'|^» G 0{k^'+^>^). Thus the size of [A', k) is 
bounded by 0(A;^*+^)™), where m is the largest arity of a relation ■ 

Remark 4.12. For MAX SNP one can prove a stronger result that essentially relies on 
Lemma [4.41 That way one obtains bounds for the sizes of A' and {A\ k) of 0{k) and 0{k'"'') 
respectively. 

5. Conclusion 

We have constructively established that the standard parameterizations of problems 
in MIN F'''ni and MAX NP admit polynomial kernelizations. Thus a strong relation be- 
tween constant-factor approximability and polynomial kernelizability has been showed for 
two large classes of problems. It remains an open problem to give a more general result 
that covers all known examples (e.g. Feedback Vertex Set). It might be profitable to 
consider closures of MAX SNP under reductions that preserve constant-factor approxima- 
bility. Khanna et al. [16j proved that APX and APX-PB are the closures of MAX SNP 
under PTAS-preserving reductions and E-reductions, respectively. Since both classes con- 
tain Bin Packing which does not admit a polynomial kernelization, this leads to the 
question whether polynomial kernelizability or fixed-parameter tractability are maintained 
under restricted versions of these reductions. 

Acknowledgement 

We would like to thank lyad Kanj for pointing out Bin Packing as an example that 
is constant-factor approximable but does not admit a polynomial kernelization. 



612 



S. KRATSCH 



References 

[1] Faisal N. Abu-Khzam. Kernelization algorithms for d-hitting set problems. In Frank K. H. A. Dehne, 
Jorg-Riidiger Sack, and Norbert Zeh, editors, WADS, volume 4619 of LNCS, pages 434-445. Springer, 
2007. 

[2] Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification 
and the hardness of approximation problems. J. ACM, 45(3):501-555, 1998. 

[3] Vineet Bafna, Piotr Berman, and Toshihiro Fujito. A 2-approximation algorithm for the undirected 
feedback vertex set problem. SIAM J. Discrete Math., 12(3):289-297, 1999. 

[4] Ifans L. Bodlaender. A cubic kernel for feedback vertex set. In Wolfgang Thomas and Pascal Weil, 
editors, STACS, volume 4393 of LNCS, pages 320-331. Springer, 2007. 

[5] Hans L. Bodlaender, Rodney G. Downey, Michael R. Fellows, and Danny Hermelin. On problems 
without polynomial kernels (extended abstract). In Luca Aceto, Ivan Damgard, Leslie Ann Goldberg, 
Magnus M. Halldorsson, Anna Ingolfsdottir, and Igor Walukiewicz, editors, ICALP (1), volume 5125 
of LNCS, pages 563-574. Springer, 2008. 

[6] Liming Cai and Jianer Chen. On fixed-parameter tractability and approximability of NP optimization 
problems. J. Comput. Syst. Set., 54(3):465-474, 1997. 

[7] Liming Cai and Xiuzhen Huang. Fixed-parameter approximation: Conceptual framework and approx- 
imability results. In Hans L. Bodlaender and Michael A. Langston, editors, IWPEC, volume 4169 of 
Lecture Notes in Computer Science, pages 96-108. Springer, 2006. 

[8] Jianer Chen, lyad A. Kanj, and Weijia Jia. Vertex cover: Further observations and further improve- 
ments. J. Algorithms, 41(2):280-301, 2001. 

[9] Rod G. Downey and M. R. Fellows. Parameterized Complexity (Monographs in Computer Science). 
Springer, November 1998. 

[10] Paul Erdos and Richard Rado. Intersection theorems for systems of sets. J. London Math. Soc, 35:85-90, 
1960. 

[11] Ronald Fagin. Generalized first-order spectra and polynomial-time recognizable sets. In Complexity of 
Computation, volume 7 of SIAM-AMS Proceedings, pages 43-73, SIAM, Philadelphia, PA, 1974. 

[12] Uriel Feige, MohammadTaghi Hajiaghayi, and James R. Lee. Improved approximation algorithms for 
minimum weight vertex separators. SIAM J. Comput., 38(2):629-657, 2008. 

[13] J. Flum and M. Grohe. Parameterized Complexity Theory (Texts in Theoretical Computer Science. An 
EATCS Series). Springer, March 2006. 

[14] Lance Fortnow and Rahul Santhanam. Infeasibility of instance compression and succinct PCPs for NP. 
In Richard E. Ladner and Cynthia Dwork, editors, STOC, pages 133-142. ACM, 2008. 

[15] Eran Halperin. Improved approximation algorithms for the vertex cover problem in graphs and hyper- 
graphs. SIAM J. Comput, 31(5):1608-1623, 2002. 

[16] Sanjeev Khanna, Rajeev Motwani, Madhu Sudan, and Umesh V. Vazirani. On syntactic versus compu- 
tational views of approximability. SIAM J. Comput., 28(1):164-191, 1998. 

[17] Phokion G. Kolaitis and Madhukar N. Thakur. Logical definability of NP optimization problems. Inf. 
Comput, 115(2):321-353, 1994. 

[18] Phokion G. Kolaitis and Madhukar N. Thakur. Approximation properties of NP minimization classes. 
J. Comput Syst Sci., 50(3):391-411, 1995. 

[19] Assaf Natanzon, Ron Shamir, and Roded Sharan. A polynomial approximation algorithm for the min- 
imum fill-in problem. SIAM J. Comput, 30(4): 1067-1079, 2000. 

[20] Rolf Niedermeier. Invitation to Fixed Parameter Algorithms (Oxford Lecture Series in Mathematics and 
Its Applications). Oxford University Press, USA, March 2006. 

[21] Naomi Nishimura, Prabhakar Ragde, and Dimitrios M. Thilikos. Smaller kernels for hitting set problems 
of constant arity. In Rodney G. Downey, Michael R. Fellows, and Frank K. H. A. Dehne, editors, IWPEC, 
volume 3162 of LNCS, pages 121-126. Springer, 2004. 

[22] Christos H. Papadimitriou. Computational Complexity. Addison Wesley, November 1993. 

[23] Christos H. Papadimitriou and Mihalis Yannakakis. Optimization, approximation, and complexity 
classes. J. Comput. Syst. Sci., 43(3):425-440, 1991. 



This work is licensed under tine Creative Commons Attribution-NoDerivs License. To view a 
copy of tlliS license, visit http : //creativecommons . org/licenses/by-nd/3 . 0/ 



